NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

GSAC: Leveraging Gaussian Splatting for Photorealistic Avatar Creation with Unity Integration

Zhang, Rendong; Watkins, Alexandra; Sarkar, Nilanjan (July 2025, IEEE; 2025 the 11th International Conference on Virtual Reality)

Photorealistic avatars have become essential for immersive applications in virtual reality (VR) and augmented reality (AR), enabling lifelike interactions in areas such as training simulations, telemedicine, and virtual collaboration. These avatars bridge the gap between the physical and digital worlds, improving the user experience through realistic human representation. However, existing avatar creation techniques face significant challenges, including high costs, long creation times, and limited utility in virtual applications. Manual methods, such as MetaHuman, require extensive time and expertise, while automatic approaches, such as NeRF-based pipelines often lack efficiency, detailed facial expression fidelity, and are unable to be rendered at a speed sufficent for real-time applications. By involving several cutting-edge modern techniques, we introduce an end-to-end 3D Gaussian Splatting (3DGS) avatar creation pipeline that leverages monocular video input to create a scalable and efficient photorealistic avatar directly compatible with the Unity game engine. Our pipeline incorporates a novel Gaussian splatting technique with customized preprocessing that enables the user of ”in the wild” monocular video capture, detailed facial expression reconstruction and embedding within a fully rigged avatar model. Additionally, we present a Unity-integrated Gaussian Splatting Avatar Editor, offering a user-friendly environment for VR/AR application development. Experimental results validate the effectiveness of our preprocessing pipeline in standardizing custom data for 3DGS training and demonstrate the versatility of Gaussian avatars in Unity, highlighting the scalability and practicality of our approach.
more » « less
Free, publicly-accessible full text available July 9, 2026
Instilling the perception of weight in augmented reality using minimal haptic feedback

https://doi.org/10.1038/s41598-024-75596-7

Watkins, Alexandra; Ghosh, Ritam; Ullal, Akshith; Sarkar, Nilanjan (December 2024, Scientific Reports)

Full Text Available
Every “Body” Gets a Say: An Augmented Optimization Metric to Preserve Body Pose During Avatar Adaptation in Mixed/Augmented Reality

https://doi.org/10.1109/TVCG.2024.3388376

Watkins, Alexandra; Ullal, Akshith; Sarkar, Nilanjan (July 2025, IEEE Transactions on Visualization and Computer Graphics)
A Universal Web-Based Tool for Multimodal Data Synchronization and Labeling

https://doi.org/10.1007/978-3-031-93965-5_18

Khan, Nibraas; Haan, Ruj; Shragge, Ingrid; Zilinskaite, Gabija; Plunk, Abigale; Staubitz, John; Rajaraman, Adithyan; Weitlauf, Amy; Sarkar, Nilanjan (January 2025, Springer Nature Switzerland)

Full Text Available
Pilot study of a real-time early agitation capture technology (REACT) for children with intellectual and developmental disabilities

Khan, Nibraas; Plunk, Abigale; Zheng, Zhaobo; Adiani, Deeksha; Staubitz, John; Weitlauf, Amy; Sarkar, Nilanjan Sarkar (October 2024, Digital health)

Objective: Children and adolescents with intellectual and developmental disabilities (IDD), particularly those with autism spectrum disorder, are at increased risk of challenging behaviors such as self-injury, aggression, elopement, and property destruction. To mitigate these challenges, it is crucial to focus on early signs of distress that may lead to these behaviors. These early signs might not be visible to the human eye but could be detected by predictive machine learning (ML) models that utilizes real-time sensing. Current behavioral assessment practices lack such proactive predictive models. This study developed and pilot-tested real-time early agitation capture technology (REACT), a real-time multimodal ML model to detect early signs of distress, termed “agitations.” Integrating multimodal sensing, ML, and human expertise could make behavioral assessments for people with IDD safer and more efficient. Methods: We leveraged wearable technology to collect behavioral and physiological data from three children with IDD aged 6 to 9 years. The effectiveness of the REACT system was measured using F1 score, assessing its usefulness at the time of agitation to 20s prior. Results: The REACT system was able to detect agitations with an average F1 score of 78.69% at the time of agitation and 68.20% 20s prior. Conclusion: The findings support the use of the REACT model for real-time, proactive detection of agitations in children with IDD. This approach not only improves the accuracy of detecting distress signals that are imperceptible to the human eye but also increases the window for timely intervention before behavioral escalation, thereby enhancing safety, well-being, and inclusion for this vulnerable population. We believe that such technological support system will enhance user autonomy, self-advocacy, and self-determination.
more » « less
Full Text Available
Exploring the Intersection of Autism, Theory of Mind, and Driving Performance in Novice Drivers

https://doi.org/10.1007/s10803-024-06526-9

Plunk, Abigale; Weitlauf, Amy_S; Warren, Zachary; Levin, Daniel; Sarkar, Nilanjan (August 2024, Journal of Autism and Developmental Disorders)

Abstract This study explores the intersection of Theory of Mind (ToM) abilities and driving performance among novice drivers, with a focus on autistic individuals. The purpose is to investigate how ToM deficits may impact driving behaviors and decision-making, ultimately informing the development of tailored interventions and training programs for autistic drivers. We conducted a series of driving simulations using a custom-built driving simulator, capturing multimodal data including driving performance metrics, attention allocation, and physiological responses. Participants were categorized based on NEPSY scores, which assess ToM abilities, and self-reported autism spectrum disorder (ASD) diagnosis. Driving tasks were designed to simulate real-world scenarios, particularly focusing on intersections and merging, where ToM skills are crucial for safe navigation. Our analysis revealed differences in driving behaviors among participants with varying ToM abilities as determined through the NEPSY. Participants with lower NEPSY scores exhibited less smooth driving behaviors, increased risk-taking tendencies, and differences in attention allocation compared to those with higher scores. Alternatively, individuals with ASD displayed comparable driving patterns overall. ToM abilities influence driving behaviors and decision-making, particularly in complex social driving scenarios. Tailored interventions addressing ToM deficits and stress management could improve driving safety and accessibility for autistic individuals. This study underscores the importance of considering social cognitive factors in driving education and licensure pathways, aiming for greater inclusivity and accessibility in transportation systems.
more » « less
From Lab to a Long-Term Care Facility: Lessons Learned from Field Deployment of Augmented Reality Telepresence System as an Interactive Communication Technology

https://doi.org/10.1109/ISMAR-Adjunct64951.2024.00146

Tauseef, Mahrukh; Ullal, Akshith; Watkins, Alexandra; Ingram, Zalen; Maxwell, Cathy; Tate, Judith; Juckett, Lisa; Mion, Lorraine C; Sarkar, Nilanjan (October 2024, IEEE)

Full Text Available
MicroXercise: A Micro-Level Comparative and Explainable System for Remote Physical Therapy

https://doi.org/10.1109/CHASE60773.2024.00017

Wang, Hanchen David; Khan, Nibraas; Chen, Anna; Sarkar, Nilanjan; Wisniewski, Pamela; Ma, Meiyi (June 2024, IEEE)

Full Text Available
An Iterative Participatory Design Approach to Develop Collaborative Augmented Reality Activities for Older Adults in Long-Term Care Facilities

https://doi.org/10.1145/3613904.3642595

Ullal, Akshith; Tauseef, Mahrukh; Watkins, Alexandra; Juckett, Lisa; Maxwell, Cathy A; Tate, Judith; Mion, Lorraine; Sarkar, Nilanjan (May 2024, ACM)

Full Text Available
Dialogue Act Classification via Transfer Learning for Automated Labeling of Interviewee Responses in Virtual Reality Job Interview Training Platforms for Autistic Individuals

https://doi.org/10.3390/signals4020019

Adiani, Deeksha; Colopietro, Kelley; Wade, Joshua; Migovich, Miroslava; Vogus, Timothy J.; Sarkar, Nilanjan (June 2023, Signals)

Computer-based job interview training, including virtual reality (VR) simulations, have gained popularity in recent years to support and aid autistic individuals, who face significant challenges and barriers in finding and maintaining employment. Although popular, these training systems often fail to resemble the complexity and dynamism of the employment interview, as the dialogue management for the virtual conversation agent either relies on choosing from a menu of prespecified answers, or dialogue processing is based on keyword extraction from the transcribed speech of the interviewee, which depends on the interview script. We address this limitation through automated dialogue act classification via transfer learning. This allows for recognizing intent from user speech, independent of the domain of the interview. We also redress the lack of training data for a domain general job interview dialogue act classifier by providing an original dataset with responses to interview questions within a virtual job interview platform from 22 autistic participants. Participants’ responses to a customized interview script were transcribed to text and annotated according to a custom 13-class dialogue act scheme. The best classifier was a fine-tuned bidirectional encoder representations from transformers (BERT) model, with an f1-score of 87%.
more » « less
Full Text Available

« Prev Next »

Search for: All records